TITLE 

PROMOTER AND REGULATOR SEQUENCES Ha dslO Gl: A GENE LEA OF 
SUNFLOWER EXCLUSIVELY EXPRESSED IN SEEDS SINCE THE 
MATURATION PHASE. 

5 

TECHNICAL SECTOR 

Agriculture. The present invention refers to the obtaining of regulatory 
("promoter") DNA sequences and the construction of new chimeric genes, and to the use 
of these sequences, capable of being specifically expressed in transgenic plant seeds. Ha 

10 dslO Gl gene has the peculiarity of only being expressed in sunflower seeds from the 
maturation until the desiccation phase, without responding to hormones such as abscicic 
acid (ABA) or water stress in vegetative tissues. Furthermore, gene Ha dslO Gl is 
expressed homogeneously in inmiature embryos and preferably in the palisade 
parenchyma of mature embryo cotyledons. These expression pattems, as well as the high 

15 activity levels of the gene, suggest that its regulatory sequences are particularly 
appropriate for the genetic manipulation of storage substances in seeds. 

PRIOR ART 

Up to now in order to confer specific expression in transgenic plant seeds, 
20 promoters have been isolated, characterised and used, especially belonging to plant genes 
which code for storage proteins or other products solely expressed in seeds, during 
different phases of development [see the following references and patents, as well as 
other documents cited therein Thomas TL, in Plant Cell, vol 5, pp 1401-1410, 1993; 
Gatehouse JA and Shirsat AH in Control of Plant Gene Expression, pp 357-375, CRC 
25 press, 1993; US-5,530,192, US-5,530,194 and US-5 ,420,034]. For example, this has 



allowed the obtaining of new transgenic plants with modified fatty acid and storage 
protein content [see: Voelker TA, Worrell AC, Anderson L, Bleibaum J, Fan C, Hawkins 
DJ, Radke SE and Davies HM, in Science, vol. 257, pp.72-74, 1992; and Saalbach I, 
Pickardt T, Machemehl F, Saalbach G, Schieder O, and Muntz K, in Molecular and 
General Genetics 242: 226-236, 1994]. Other promoters with different tissue specificity 
in seed and varied temporal expression patterns could be useful for the development of 
the enormous potential of this technique. Recently the expression in seeds of genes that 
code for low molecular weight heat shock proteins was described (sHSPs: small heat- 
shock proteins). One of these genes. Ha hspllJ G4, shows in tobacco transgenic plants, 
expression patterns appropriate for its possible use in the genetically engineered 
modification of seeds: this gene is expressed from early seed maturation phases, and is 
specific of cotyledon tissue [Coca MA, Almoguera C, Thomas TL and Jordano J, in: 
Plant Molecular Biology 31: 863-876, 1996]. However, gene Ha hsplJJ G4, like other 
sHSP plant genes expressed in seeds, is also expressed in response to heat (heat shock) in 
plant vegetative tissues after seed germination. The latter makes its use in genetic 
engineering impossible in the case that regulatory DNA sequences that guarantee the 
absence of expression of chimeric genes outside of the seed are required: for example, 
when the expression elsewhere of these genes may affect the viability, the growth, or the 
health of the transgenic plants. To solve these problems the Hahspl7.7 G4 gene 
regulatory sequences were modified in such a way that the chimeric genes containing 
these sequences maintain their expression in seeds, and lose their heat induction; a 
procedure which can be used for the modification and similar use of regulatory sequences 
of other sHSP genes expressed in seed [Almoguera, Prieto-Dapena and Jordano, patent 
request #9602746 (Spanish Patent Office)]. Alternatively, a similar use for the promoter 
and regulatory sequences of the sunflower gene Ha hspl7,6 Gl, which is only expressed 
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in seeds, is proposed. This gene does not respond to heat or other types of stress (cold, 
dehydration, ABA hormone treatment) in vegetative tissues [Carranco, Almoguera and 
Jordano, Spanish patent appHcation #970121. 

Alternative analogous uses for promoter and regulatory sequences of sunflower 
5 LEA Ha dslO Gl gene are proposed. Gene Ha dslO Gl has been found in a genomic 
clone corresponding to a previously described cDNA {Ha dslO, access number X506999) 
whose expression patterns were not totally known [Almoguera and Jordano, Plant MoL 
Biol. 19:781-792, 1992]. The promoter and regulatory sequences of this gene (Ha dslO 
Gl) have been cloned and are described, characterised and used for the first time in the 

10 examples of this application. The Ha dslO Gl gene belongs to the Class I LEA (Late 
Embryogenesis Abundant) gene family (D-19 or LEA-I type ) These genes code for 
highly conserved proteins in various plant species, and their expression is usually 
restricted to seeds and early germination phases [see for example the following reviews: 
Dure in, L., Structural motifs in Lea proteins^ in Plant Responses to Plant Dehydration 

15 During Environmental Stress,, Close TJ and Bray EA Eds., Current Topics in Plant 
Physiology 10: 91-103, 1993; and Delseny M, Gaubier P, Hull G, Saez-Vasquez J, 
Gallois P, Raynal M, Cooke R, Grellet F., Nuclear Genes expressed during seed 
desiccation: relationship with responses to stress , in Stress-induced Gene Expression in 
Plants (Basra, A. S., ed.), pp. 25-59, Harwood Academic Publishers, Reading, 1994]. 

20 LEA gene promoters have not been considered so good candidates for their use in seed 
storage substance modification projects, since usually their activity is expressed in later 
seed maturation phases, such as embryo desiccation [Kridls JC, Knauf VC, Thompson Ga 
in Control of Plant Gene Expression, pp. 481-498, CRC press, 1993]. However, LEA 
genes that are activated in maturation phases prior to desiccation are known, such as the 

25 cotton genes denominated LEA-A [Hughes DW and Galau GA, The Plant Cell 3:605- 



618, 1991]. Examples of activation prior to desiccation are also known in the class I LEA 
genes, such as in the case of At Eml, emb564 and embl genes [in arabidopsis, maize and 
carrot, respectively: Gaubier P, Raynal M, Hull G, Huestis GM, Grellet F, Arenas C, 
Pages M, and Delseny M, Mol. Gen. Genet., 238: 409-418, 1993; Williams B, and Tsang 
A, Plant Mol Biol, 16: 919-923, 1991; Wurtele ES, Wang H, Durgerian S, Nikolau BJ, 
and Ulrich TH. Plant Physiol 102:303-312, 1993]. These examples seem to indicate the 
possible use of regulatory sequences from genes of this family, for the modification of 
seeds. However, its specific use would be limited both by the expression levels obtained 
in each case and in each development phase; as well as by the different tissue specificity. 
Thus, even though in Arabidopsis the At Eml gene is early activated, its expression is 
basically restricted to cotyledon provascular tissue and cortical tissue external to the 
embryonic axis [Gaubier, P., Raynal, M., Hull, G., Huestis, GM., Grellet, F., Arenas, C., 
Pages, M., and Delseny, M., Mol Gen, Genet., 238: 409-418, 1993]. In the case of the 
carrot gene, embl, its mRNA are preferably locaUsed in the embryonic meristems, 
especially in the procambium [Wurtele ES, Wang H, Durgerian S, Nikolau BJ, and Ulrich 
TH. Plant Physiol 102:303-312, 1993], No gene sequence of the emb564 gene has been 
published and the exact localisation of its mRNA is unknown [Williams B and Tsang A, 
Plant Mol Biol, 16: 919-923, 1991]. 

The expression of sunflower gene Ha dslO Gl, as well as its promoter and 
regulatory sequences present unique characteristics among the other members of the 
LEA-I family, as described below, which means that these sequences may be potentially 
used for the modification of seeds by genetic engineering. 



DESCRIPTION OF THE INVENTION 

The present invention refers to the isolation and characterisation of the promoter 



and regulatory sequences of a sunflower LEA-I gene, Ha dslO Gl, in transgenic tobacco 
plants. These sequences (Example 1) present highly appropriate characteristics for their 
use in the modification of seeds (e.g. storage substances). The advantages of their 
possible use in transgenic plants are demonstrated in other examples: A.- Studies of HA 
dslO mRNA accumulation and localisation in the homologous system (Example 2). These 
studies demonstrate both the high expression levels reached during embryogenesis since 
early maturation phases, as well as the absolute seed specific localisation, accompanied of 
a homogenous distribution in embryos, which terminates essentially restricted to the 
cotyledon palisade parenchyma, a tissue specialised in the accumulation of sunflower 
storage substances. B.- In example 3 the possible use of such sequences, via the 
construction and analysis of various chimeric genes in transgenic plants, is illustrated, 
using the promoter and combinations of various Ha dslO Gl regulatory sequences (5- 
flanking, coding, intron and 3'-flanking), with the reporter gene of bacterial P- 
glucuronidase (GUS). These examples demonstrate the usefulness of the different 
chimeric genes tested in a heterologous model (tobacco): high expression level and seed 
specificity since early maturation phases, as well as the functional contribution of the 
various sequences tested. The examples attached demonstrate that the seed specificity is 
basically conferred by the promoter and the 5'-flanking sequences of Ha dslOGl 
(including untranscribed and transcribed sequences, such as the 5'-UTR, and part of the 
coding sequence). Additionally, the 3'-flanking sequences increase expression levels in 
seeds, and the intron specifically reduces the expression levels in non-embryonic tissues. 
Due to the conservation of the regulation of embryonic gene expression in plant seeds, 
including LEA-I genes [Thomas TL, in I 5:1401-1410, 1993]; these sequences could be 
used both in the homologous system (sunflower) as in other heterologous systems of great 
economic importance (for example oilseed rape, soybean, maize, etc). 



The practical embodiment of this invention, represented by the attached examples 
and figures, uses conventional Molecular Biology, Microbiology, recombinant DNA and 
transgenic plant production techniques that are common practice in laboratories 
specialised in these fields. These techniques have been explained in sufficient detail in the 
scientific literature [Sambrok J, Fritsch EF, and Maniatis T, Molecular Cloning: A 
Laboratory Manual Cold Spring Harbor laboratory Press, 2nd Edition, 1989; Glover 
DM, DNA Cloning, IRL Press , 1985; Lindsey K., Plant Tissue Culture Manual, Kluwer 
Academic Publishers, 1993; and Gelvin SB, Schilperoort RA, Verma DPS, Plant 
Molecular Biology Manual, Kluwer Academic Publishers, 1992]. For more details, other 
references are cited in the corresponding section in this application. 

EXAMPLE 1: Cloning, determination of restriction map, nucleotide sequence and 

analysis of the Ha dslO Gl promoter. 

To obtain the Ha dslO Gl clone, the sunflower genomic DNA gene library 

described by Coca et al [Plant MoL Biol. 31: 863-876, 1996] was screened, with the 

probe corresponding to total Ha dslO cDNA [Almoguera and Jordano, Plant Mol. Biol. 

19: 781-792, 1992]; using standard hybridisation conditions and molecular cloning 
procedures described in detail in the first reference of Coca et al, 1996. A phage 
aCEMll) with a sunflower genomic DNA insert of approximately 16.5 Kb - whose 
partial map is shown in Figure 1 - was isolated. Using restriction analysis it was 
determined that the two fragments adjacent to the Sac I site (4.2 and 9.3 Kb) contain the 
sequences that hybridise with the cDNA. A detailed restriction map of the first of these 
fragments was determined and part (4 Kb) of the second (Figure 1). Different genomic 
DNA subfragments, corresponding to the mapped region, were cloned in pBluescript SK+ 
vector, resulting in plasmids whose names and inserts are listed in Figure 1. The 3617 bp 



nucleotide sequence between the Sac I and Sma I sites (Figure 1, lower section) was 
determined from these plasmids on both DNA strands using the Sanger (dideoxy) method. 
These data are presented in SEQ No. 1. By comparing the sequences it was confirmed 
that part of the genomic sequence determined corresponds to Ha dslO cDNA [Almoguera 
and Jordano, Plant Mol. Biol. 19: 781-792, 1992, GenBank access number X59699]. The 
amino acid sequence of the protein coded by the Ha dslO Gl gene is indicated below the 
corresponding nucleotide sequences. In the genomic DNA, the coding region is 
interrupted by an abnormally long intron (1024 bp), even though it is situated in a 
conserved position in other class I LEA genes [see data reviewed by Simpson GC, Leader 
DJ, Brown JWS and Franklin T, in Characteristics of Plant pre-mRNA Introns and 
Transposable Elements, Plant Mol Biol. LabFax, pp 183-252; Croy RRD Ed., Bios 
Scientific Publishers Ltd. 1993]. The only difference between the gene sequences coding 
for mRNA and those of cDNA, was a two nucleotide inversion (GC instead of CG) within 
the second exon (in positions +1176 and +1177 from the initiation codon). Said inversion 
induces an amino acid change (S instead of T) in the protein sequence. The difference is 
due to an error (a compression) in the initial reading of the cDNA sequence reactions. The 
Ha dslO Gl sequences determined also include 1576 bp of the gene promoter and 5'- 
flanking region, as well as the 553 bp of 3'-flanking genomic regions which are not 
present in the original cDNA. 

Three possible transcription initiation sites were determined in the Ha dslO Gl 
promoter by the primer extension technique. Two of these sites have been confirmed with 
other techniques (sites 1 and 2, indicated by arrows in SEQ No. 1). For this purpose the 
procedure described by Domon et al. was used [Domon C, Evrard JL, Pillay DTN and 
Steinmetz A. Mol. Gen. Genet. 229:238-244, 1991], whereby the total sunflower embryo 
RNA was hybridised with the synthetic primer: 5'- 
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CrCCTGTTCCGGAATTTTGCGTGT-3* [SEQ ID NO: 2]. The sequence of said 
synthetic primer corresponds to that of the non coding strand of Ha dslO Gl, between 
positions +25 and +48, from the initiation codon. The hybridisations with the primer were 
carried out at 62°C. The hybrids were extended with AMV reverse transcriptase, for 90 
min at 42'*C. The extension products were analysed on 6% PAGE sequencing gels, along 
with sequence reactions produced using the same primer. Initiation sites 1 and 2 (at 
positions -33 and -25, see SEQ No. 1) are functional, and are detected independently 
using the ribonuclease A protection technique (RNAse A, see Figure 3A). A third 
initiation site (site 3, in position -119 in SEQ No. 1) could not clearly be confirmed with 
this technique. These initiation sites functionally define the 3' end of the Ha dslO Gl gene 
promoter. 

The analysis of the proximal sequences of the Ha dslO Gl gene promoter 
demonstrated that the two initiation sites detected (sites 1 and 2) are found at an 
appropriate distance from a possible TATA sequence (at position -86). The possible more 
distal site (site 3,-119) does not have clear TATA sequences in its proximity. Apart from 
these promoter elements, two possible RY "boxes" (RYl and RY2 at positions -129 and - 
65 of SEQ No. 1) were observed, analogous to those that participate in the regulation of 
the expression of numerous plant genes in seeds [Dickinson DC, Evan RP, and Nielsen 
RC, in Nucleic Acids Research 16: 371, 1988 ]. 

The RYl box sited at position -129 was modified by verifying through transient 
expression experiments in sunflower embryos, its functional requirement for the trans- 
activation of the Ha dslO Gl promoter by ABI3 type transcription factors [Giraudat J., 
Hauge BM, Valcn C, Smalle J, Parcy F, Goodman HM in The Plant Cell: 1251-1261, 
1992]. In order to do this, modifications of the dslO::GUS fusions constructed for 
transgenic plant studies were prepared (see Example 6.3 and Figure 5). The chimeric 



genes contained in these two fusions (dslOFl and dslOF2) were purified as DNA 
fragments, which were subcloned by hgation into pBluescript SK+ (Promega) vector, thus 
replacing the binary vector sequences by smaller ones. Smaller binary vector sequences 
are more useful for transient expression experiments. The plasmid pSKdslOFl was 
obtained by using the Sal I - Eco RI fragment (with the chimeric gene obtained from 
dslOFI). In the case of dslOF2, the Sph I - Eco RI fragment (from position -125 in Ha 
dslO Gl, to the 3' end of nos) was ligated to the complementary fragment (which contains 
the promoter and 5'-flanking sequences of Ha dslO Gl), purified after digestion of 
pSKdslOFl with Sph I and Eco RI, resulting in the pSKdslOF2 plasmid. Finally, from the 
pSKdslOFl and pSKdslOF2 plasmids (maps not shown) mutagenised versions were 
obtained by digestion of their DNA with Sph I, blunting the resulting ends by treatment 
with T4 DNA polymerase, followed by re-ligation of the DNA. Plasmids 
pSKdslOFlARY and pSKdslOF2ARY (maps not shown) were obtained. These plasmids 
only differ by a 5 nucleotide deletion between positions -126 and -122 of the Ha dslO Gl 
promoter. These changes destroyed the RYl box present in the dslOFI and dslOF2 
chimeric genes (see Figures 1, 2 and 5), this was verified by the Sanger (dideoxy) method 
sequencing reactions, using the primer 5'CTCCTGTTCCGGAATTTTGCGTGT3' [SEQ 
ID NO: 2] (non coding strand of Ha dslOGl between positions +25 and +48). 

The trans-activation experiments in transient expression were carried out by 
bombarding sunflower embryos with projectiles coated with DNA mixtures from 
different plasmids. These mixtures contain the reference plasmid pD0432 [Ow DW, 
Wood KV, deLuca M, de Wet JR, Helinski D and Howell SH. Science 234: 856-859, 
1996], with the firefly {Photinus pyralis) luciferase (LUC) gene regulated by the CaMV 
35S promoter, the fusion of dslO::GUS tested in each case (with intact or modified RYl 
sequences), and an effector plasmid, pABI3, which expresses the ABI3 factor under 
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control of the CaMV 35S promoter. pABD was obtained by substituting the Pv ALF 
cDNA from the pALF plasmid [Bobb AJ, Eiben HG, an Bustos MM in The Plant Journal 
8: 331--343, 1995], with ABB cDNA. The ABB cDNA was cloned as an Xba I fragment 
(blunted with Klenow enzyme) - Eco RI (partial), fragment purified from the pcabi3-4F 
plasmid [Giraudat J., Hauge BM, Valon C, Smalle J, Parcy F, Goodman HM in The Plant 
Cell 4: 1251-1261, 1992]. pABB plasmid is added to, or omitted from, the mixture to test 
the effect of the ABB factor on GUS expression in the fusion tested. The experiments 
were essentially carried out as described by Bobb et al, [Bobb AJ, Eiben HG, and Bustos 
MM in The Plant Journal 8: 331-343, 1995], with the following modifications: 
Sunflower embryos (17-20 dpa) were prepared as follows: Sunflower seeds were 
sterilised by washing in 70% ethanol for 1 min, and in 2% sodium hypochlorite with a 
drop of Triton X-100 for 40 min, finally rinsed several times with distilled water, and then 
peeled under sterile conditions. The embryos are cut longitudinally (separating the two 
cotyledons) and placed, with the cut surface down on MS solid medium plates, containing 
2% sucrose and 0.5 M sorbitol. They are then pre-cultured for 2-4 h in the dark, at room 
temperature (25°C). All the plasmids were purified using the Quantum midiprep kit 
(Biorad). Normally for each bombardment 0.2 ^g of reference plasmid, 0.1 |ig dslO::GUS 
plasmid, and 1 ng of effector plasmid (or the same amount of pjrr82 plasmid in the 
negative controls) were used. For the preparation of the gold particles, as well as the DNA 
precipitation onto them, the method described by Chem et al. was followed [Chem MS, 
Bobb AJ and Bustos M. The Plant Cell 8: 305-321, 1996]. The particle bombardment was 
carried out using the Biolistic PDS-1000 He system (Biorad). The bombardment 
conditions were the following: 1550 psi rupture membrane, 1.6 ^m diameter gold 
particles, distance from rupture membrane to macrocarrier was 8 mm, distance from 
macrocarrier to grid was 6 mm, and the distance to the tissue to be bombarded was 6 cm. 
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The bombarded cotyledons were incubated for 24 h at 28 "^C in the dark, after 
bombardment, the GUS activity (relative to LUC activity) was tested as described by 
Bobb et al [Bobb AJ, Eiben HG, and Bustos MM in The Plant Journal 8: 331-343, 
1995]. 

The addition of pABI3 effector plasmid had a clear effect on the relative 
expression of GUS/LUC in bombardments with the pSKdslOF2 fusion (average increase 
in relative activity - 46.2X). On the other hand, if the trans-activation was carried out 
with the same plasmid with a mutation in the RY box (pSKdslOF2ARYl), a significant 
reduction in the average increase in relative activity, due to the ABI3 effect (== 26.3X), 
was observed. This result, shown in figure 2, confirms the functional requirement of the 
RYl sequence (position -129 in SEQ No. 1). Therefore, this RY box participates in the 
transcriptional activation in seeds of the Ha dslO Gl promoter for ABI3 type factors 
[Giraudat J., Hauge BM, Valon C, Smalle J, Parcy F, Goodman HM in The Plant Cell 4: 
1251-1261, 1992]. Other promoter sequences (e.g. RY2 in -65) could also contribute to 
the transactivation effect observed, since the mutation tested does not completely destroy 
the activator effect of ABI3. 

EXAMPLE 2: Accu mulation and specific localisation of Ha dslO mRNA in sunflower 
embrvos: 

The messenger RNA accumulation patterns of the Ha dslOGl gene were 
determined by the Ribonuclease A (RNAse) protection technique, described in detail by 
Almoguera et al [Almoguera C, Coca MA, Jordano J. Plant Physiol 107: 765-773, 
1995]. To do this, total RNA samples prepared from seed embryos at different stages of 
development, under normal growth conditions, were used [Almoguera and Jordano, Plant 
Mol Biol 19: 781-792, 1992; Coca et al. Plant Mol Biol 25: 479-492, 1994]; from 3- 
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day seedlings after imbibition (dpi); and from different adult plant organs before 
flowering. The seedling and plant RNA were prepared from plant material obtained both 
under controlled growth conditions [Almoguera and Jordano, Plant Mol Biol 19: 781- 
792, 1992; Coca MA, Almoguera C, and Jordano J. Plant Mol Biol 25: 479-492, 1994; 
Coca MA, Almoguera C, Thomas TL, and Jordano J. Plant Mol Biol 31: 863-876, 
1996], and after stress treatments. Said stress treatments can be water deficit [Almoguera 
C, Coca MA, and Jordano J. Plant J, 4: 947-958, 1993; Coca MA, Almoguera C, Thomas 
TL, and Jordano J. Plant Mol Biol 31:863-876, 1996]; or after addition of hormones 
such as absicic acid [Almoguera C and Jordano J. Plant Mol Biol 19: 781-792, 1992; 
Coca MA, Almoguera C, Thomas TL, and Jordano J. Plant Mol Biol 31: 863-876, 
1996]. The conditions used in each treatment are described in detail in the references cited 
for each case. The riboprobe used to detect the Ha dslO Gl mRNA is 396 nucleotide 
long. 63 of these 396 nucleotides are sequences of the pBluescript SK+ vector. The 
remaining nucleotides are the sequence of the non coding strand of Ha dslO Gl between 
positions +212 and -121 (Sph I). This hybrid probe with the 5' end of Ha dslO Gl 
messenger RNAs, exceeding the more distal transcription initiation site (site 3, SEQ No. 
1), allows the detection of messenger RNA (mRNA) produced from the three initiation 
sites and the experimental verification of the initiation positions. This riboprobe was 
prepared by in vitro transcription, using RNA polymerase T3 and dslOGlS3A4.4 plasmid 
DNA (Figure 1) as a template, which contains the Ha dslOGl sequences between -1576 
(Sal I) and +212 cloned in the pBluescript SK+ vector.. 

Figure 3 shows that the Ha dslO Gl messenger RNAs are only detected in seeds. 
Higher accumulation levels are observed around 18-20 dpa. Gene expression is detected 
from 10 dpa and it disappears after germination (Figure 3). Treatments with ABA, or 
water deficit did not induce the accumulation of Ha dslO Gl messenger RNAs (data 
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shown for ABA in seedlings, Figure 3). As a positive control in the RNA samples tested 
for the different treatments, hybridisations (data not shown) with another previously 
described 651 nucleotide riboprobe of Ha hspl7J G4 gene were carried out [Coca et al. 
Plant MoL Biol 31: 863-876, 1996]; since this gene is expressed in response to the 
different treatments tested. These analysis showed that the Ha dslO Gl mRNAs were 
only 'accumulated in seeds, under normal growth conditions and since early stages of 
maturation, confirming the initiation from at least sites 1 and 2 (indicated in SEQ No. 1). 
The band marked with number 3 (Figure 3) does not coincide well with the expected size 
for initiation site 3 (SEQ No. 1). This band could be due to the protection of messenger 
RNA sequences of a highly homologous gene, or even Ha dslO Gl itself, containing 
intron sequences (unprocessed mRNA). 

The distribution of Ha dslO Gl mRNAs in sunflower embryos was investigated 
by in situ hybridisation localisation experiments. In order to do this, embryos were 
embedded in paraffin, fixed, sectioned and hybridised with specific probes; essentially as 
described by Molinier [in the thesis: Diplome d'Etudes Approfondies de Biologic 
Cellulaire et Moleculaire, Universite Louis Pasteur, Strasbourg, 1995]. The fixing time 
was increased from 16 h at 4*'C to 5 days, the increase depending on the age of the 
embryos. The dehydration of the fixed embryos was carried out by successive incubations 
(2 times each for 30-90 min.) in 10%, 20%, 30%, 40%, 60%, 70%, 95% and 100% 
ethanol; followed by immersion in 100% toluene (l-3h, 2 times). The fixed embryos were 
first embedded in toluene : paraffin (1:1), at 65'*C for 6-15 h, followed by 5 consecutive 
inclusions in paraffin, at 60°C for 5-15 h. The pre-hybridisations and hybridisations with 
the probes were carried out at 45''C. The specific Ha dslO Gl riboprobe, corresponding to 
the mRNA 3' end was prepared as follows: The dslOGlSl plasmid (Figure 1) was used as 
a template to prepare two in vitro transcription probes [Almoguera C, Coca MA and 
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Jordano J. Plant Physiol 107: 765-773, 1995] marked with DIG-UTP, The dslO-3X-) is 
obtained by digesting plasmid DNA with Pvu n and carrying out the transcription with 
RNA polymerase T3. This probe corresponds to the non-coding strand of Ha dslO Gl 
between positions +1202 (Pvu H in the second exon) and +1592 (3' end). The second 
probe [dslO-3' (+), used as a control], was prepared digesting Ha dslO GlSl DNA with 
Bam HI (in the polylinker)\ and carrying out the transcription with RNA polymerase T7. 
Probe dslO-3'(+) contains the coding chain of Ha dslO Gl, between position +870 and 
+1592. The specificity of the hybridisation was determined by Southern blot experiments 
similar to those described by Almoguera and Jordano [Plant Mol Biol 19: 781-792, 
1992]. While the hybridisation with a total cDNA probe detects bands corresponding to 
some 4-5 different genes in the sunflower genome [Almoguera C, and Jordano J. Plant 
Mol Biol 19: 781-792, 1992], a single gene (with a slight cross hybridisation with 
another one, data not shown) was detected using probe dsl0-3'(-). 

The results obtained in the RNA localisation experiments are shown in Figure 4. 
Probe dsl0-3'(-) is complementary to Ha dslO Gl mRNA, and has opposite polarity to 
Ha dslO Gl mRNA, what allows its detection. The results obtained agree with the 
protection data shown in Figure 3, and demonstrate its accumulation in embryos from 12- 
15 dpa (Figure 4A) to 21-28 dpa (Figures 4C, F and H). This accumulation takes place to 
high levels, which can be deduced from the short time required for its histochemical 
detection (2-4 hours). In immature embryos (Figure 4A) the distribution of Ha dslO Gl 
mRNA is homogeneous and comparable (Figure 4B) to that of 18S rRNA, which is 
detected using another riboprobe corresponding to fragment G (Eco RI) of the radish 18S 
gene [described by Delcasso-Tremousaygue D, Grellet F, Panabieres F, Ananiev E D, and 
Delseny, M. in Eur, J. Biochem, 111: 767-776, 1988]. In more mature embryos (21 dpa, 
Figure 4C) the Ha dslO Gl mRNA are also localised fairly homogeneously, with a more 
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intense accumulation detected in the vascular bundles iprocambium), something which is 
neither observed with the 18S rRNA probe nor in this or other development stages 
(Figures 4D, B and G), Finally, at 28 dpa the Ha dslO Gl mRNA are preferably localised 
in the palisade parenchyma, a tissue specialised in the accumulation of storage 
substances, located in the internal face of cotyledons (Figures 4F and H). The 
localisations with probe dslO-3' (+), with the same polarity as the Ha dslO Gl mRNA, 
did not give any hybridisation signal, which was a control for the previously described 
experiments (compare Figures 4C and E). These experiments demonstrated that the Ha 
dslO Gl mRNA expression patterns in sunflower are very special. The expression 
observed in seeds, with high levels of accumulation since early embryonic maturation 
stages (10-12dpa), are combined with spatial distributions which change from 
homogeneity to a greater abundance in storage substance deposit tissues (palisade 
parenchyma). The distribution and accumulation pattern of Ha dslO Gl mRNA is 
different from that presented by other plant genes belonging to the same family [Wurtele 
ES, Wang HQ, Durgerian S, Nikolau BJ and Ulrich TH. Plant Physiol 102: 303-312, 
1993; Gaubier, P., Raynal, M., Hull, G., Huestis, GM., Grellet, F., Arenas, C, Pages, M., 
and Delseny, M., Mol Gen. Genet., 238: 409-418, 1993]. These results indicate the 
potential usefulness of chimeric genes that incorporate Ha dslO Gl regulatory sequences 
for the modification of seeds by genetic engineering. 

EXAMPLE 3: Construction of dslOGl::GUS chimeric genes and their analvsis in tobacco 
transgenic plants:. 

As an example of the possible uses of the promoter and the regulatory sequences 
of Ha dslO Gl gene in the construction of chimeric genes with specific expression in 
transgenic plant seeds, the construction and analysis of 4 dslOGl::GUS translational 
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fusions in tobacco transgenic plants (Figure 5) are described below. These fusions contain 
the promoter and different combinations of flanking and intragenic sequences of Ha dslO 
Gl gene for its functional analysis. These 4 fusions provide high levels of expression of 
the reporter gene (GUS) in seeds from early maturation stages (Figure 6), confirming the 
observations in the homologous system (Example 2, Figures 1-4). 

The first of these constructions, dslOFl (Figure 5) was obtained from the 
dslOGlS3 plasmid (Figure 1), which contains the genomic sequences of Ha dslO Gl 
between Sal I (-1576) and Eco RI (+1086), subcloned into the corresponding restriction 
sites of the pBluescript SK+ vector (Promega). The Ha dslO Gl sequences between Eco 
RI (+1086) and position +98 (in the first exon) were deleted by treating with Exonuclease 
m of the dslOGlS3 DNA (previously digested with Hind m and Pst I) Said deletion 
results in the dslOGlS3A10.5 plasmid (Figure 1). This plasmid was digested with Bam HI 
ipolylinker restriction target situated inmiediately adjacent to position +98 of Ha dslO 
Gl), and then, filling in the digested DNA ends using the Klenow fragment of DNA 
polymerase I. The DNA was then digested with Sal I, and the 1679 bp fragment 
containing the Ha dslO Gl sequences between Sal I (-1576) and the filled end of Bam HI 
was purified. This fragment was cloned between the Sal I and Sma I sites of the pBI 101.2 
binary vector, resulting in dslOFl. dslOFI is a translational fusion which contains 1576 
nucleotides of 5*-flanking Ha dslO Gl sequences (from ATG) and the first 98 nucleotides 
of the coding gene, in phase with the GUS gene (Figure 5). The dslOF2 fusion was 
derived from dslOFl, through the insertion of a genomic DNA fragment of Ha dslOGl 
comprised between positions (Figure 1) +1205 (Pvu II) and Eco RI (=+4670) . This 
fragment contains part of the second exon and =3370 nucleotides of 3*-flanking sequences 
(from the termination codon in position +1301); and replaces the nos-3* sequences in the 
dslOFl fusion. The Pvu 11- Eco RI insert was purified from dslOGlS2 plasmid DNA. For 
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the insertion of this fragment, the dslOFl DNA was digested with Sac I, and the DNA 
ends were blunted by treating with T4 DNA polymerase L Then, the DNA thus treated 
was digested with Eco RI, and the fragment including the Ha dslOGl sequences was 
purified. This fragment was ligated to the previously described Pvu H- Eco RI insert (with 
the Ha dslO Gl 3'-flanking sequences), resulting in the dslOF2 fusion (Figure 4). The 
dslOF2A fusion (Figure 4) was obtained from dslOF2, through the deletion of the Ha 
dslOGI 3'-flanking sequences between Xba I (==+2830) and Eco RI (==44670). To do this, 
dslOF2 DNA was digested with both enzymes, religating, after blunting the resulting 
DNA ends with the Klenow fragment of DNA polymerase I. Finally, the fourth fusion 
(dslOF3, Figure 5) was obtained from a Ha dslO Gl genomic DNA fragment between Sal 
I (-1576) and Pvu H (+1204), purified from dslOGlS6 plasmid (Figure 1), after digestion 
with both restriction enzymes. This fragment was ligated with vector pBIlOl.3 vector, 
previously digested with Sal I and Sma I. The dslOF3 fusion thus contains the promoter 
and the same 5*-flanking sequences of Ha dslO Gl present in dslOFI fusion, as well as 
the first exon (From +1 to +145), the total intron (from +146 to +1169) and part of the 
second exon of Ha dslO Gl (from +1170 to +1204), fused in phase with the pBI 101.3 
GUS gene. In all cases the nucleotide sequence corresponding to the fusion zone, between 
the GUS and the Ha dslO Gl sequences, was tested by sequencing reactions with the 
Sanger (dideoxy) method, using GUS sequences as the primer: 5- 
ACGCGCTTTCCCACCAACGCTG-3' [SEQ ID NO: 3]. 

The T-DNA in dslOFI, dsl0F2, dslOF2A and dsl0F3 fusions (Figure 5) was 
mobilised from A. tumefaciens (LBA 4404), obtaining different tobacco transgenic plants 
with independent integrations of each chimeric gene. These plants were obtained and 
characterised by standard techniques as described in detail by Coca MA, Almoguera C, 
Thomas TL and Jordano J, [in Plant Molecular Biology , 31: 863-876, 1996]. The 
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expression of GUS gene was analysed both in developing seeds and under normal growth 
conditions (without exogenous stress); as in seedling tissues, in the latter case the 
expression changes induced by ABA and dehydration treatments were studied. The seed 
analysis were carried out with the original transgenic plants (TO), while for the seed 
analysis of the seedlings descendants of these plants (Tl) were used, which are 
segregating for the chimeric genes. Quantitative studies by fluorometric analysis of GUS 
expression levels and their temporal patterns, as well as qualitative studies which 
analysed histochemically the spatial patterns of expression (tissue specificity) were 
carried out. These studies were carried out as described in detail by Coca MA, Almoguera 
C, Thomas TL and Jordano J, [in Plant Molecular Biology, 31: 863-876, 1996]. In total, 
the following number of tobacco transgenic plants, TO "functional", containing the 
chimeric genes were obtained and analysed: containing dslOFl: 14 plants, containing 
dslOF2: 7 plants, containing dslOF2A: 8 plants, and containing F3: 23 plants. These 
plants showed high levels of GUS gene expression in seeds (as a result of the activity of 
the Ha dslO Gl gene promoter and regulatory sequences), as illustrated in Figure 6 
(panels A-C). The integration of the different chimeric genes in the DNA of the 
transgenic plants was characterised by Southern analysis using probes for the coding 
GUS gene region; PGR amplifications of the sequences close to the dslO::GUS splice, 
using the 5'-ACGCGCTTTCCCACCAACGCTG-3' [SEQ ID NO: 3] (GUS) and 5'- 
GAGTGAACAgAATtcCATCACAACAGGG-3' [SEQ ID NO: 4] (dslOEco RI) primers; 
or by the Kanamycin resistance segregation test (conferred by the nptll gene), performed 
as described in Jordano J, Almoguera C, and Thomas TL, The Plant Cell 1: 855-866, 
1989. These analysis determined that the TO plant selected for the seed expression studies 
contained 1 to 5. integrations independent on the corresponding chimeric gene. Figure 6 
illustrates the more relevant results obtained in the study of the expression of the chimeric 
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genes analysed in transgenic plants. These results are described in detail below. 

GUS expression during seed maturation under controlled growth conditions 
(without exogenous stress), was analysed by fluorimetric (Figure 6A) and histochemical 
(summary in Figures 6B-E) assays. The fluorimetric assays were carried out in seeds at 
defined maturation stages, 12, 16, 20, 24 and 28 days post-anthesis (dpa). For each TO 
plant and maturation stage, two different floral capsule extracts were prepared, and the 
GUS activity was assayed with Methylumbelliferylglucuronide (MUG) in duplicate (in 
total four activity determinations per development stage and per individual transgenic 
plant). The statistical significance of the differences observed with the different GUS 
fusions was determined, after logarithmic normalisation of the data obtained, by variance 
analysis [ANOVA, see: Nap JP, Keizer P, and Jansen R, in Plant Molecular Biology 
Reporter 11: 156-164, 1993]. The histochemical assays were carried out with material 
dissected from seeds, at defined development stages, from the following number of 
transgenic plants: dlOFl, 5, dslOF2, 6, dslOF2A, 6 and dsF3, 19. The endosperm and the 
embryos dissected from individual seeds were stained with X-gluc, for 150 min at 25°C, 
approximately 150 seeds from each transgenic plant were analysed in this manner. 

All chimeric genes produced high levels of GUS expression in seeds, reaching 
average maximum values of 1.65 x 10^ pmol MU/ mg x min (Figure 6A: at 24 dpa). The 
histochemical assays confirmed these high activity values, since both the embryos 
(Figures 6B and C) and the endosperm (Figure 6C) were strongly stained from 12 dpa 
(Figure 6B) and with only 150 min of reaction. In both cases fairly homogeneous spatial 
distributions of the GUS activity were observed (Figure 6B-C). Furthermore, these 
expression patterns do not differ qualitatively between the different chimeric gene 
transgenic plants (data not shown). 

The fluorimetric assays revealed interesting quantitative differences between the 
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different dslO::GUS fusions. These differences depend on the Ha dslO Gl sequences 
present in the fusions. In some cases the statistical significance of these differences could 
be demonstrated (with a confidence level of 95%), which experimentally demonstrates 
the contribution of the different sequences tested (promoter and 5'-flanking sequences, 
coding sequences, 3'-flanking and intron) to the embryonic expression patterns observed. 
The presence of Ha dslO Gl 3'-flanking sequences in the fusions increases the GUS 
expression levels in seeds between 20 and 28 dpa (compare fusions dslOF2 and dslOF2A, 
with dslOFl in Figures 5 and 6A). This difference is statistically significant (for example 
at 28 dpa: F = 5.397, P: 0.0213), and is caused by the Ha dslO Gl sequences present in 
the dslOF2A fusion (see Figure 5); since no significant differences were found between 
the GUS activity of dslOF2 and dslOF2A (for example, also at 28 dpa, F=:0.274, 
P=0,6015; see Figure 6A). In the case of dslOF2A, the stimulating effect of the 3'- 
flanking sequences also occurs and is highly significant, in earlier embryonic maturation 
development stages (Figure 6A, 16 dpa; F=16.607, P=0.001), On the other hand, in these 
stages (between 12 and 16 dpa) dslOFI and dsl0F2 GUS activities do not differ 
significantly (e.g. at 16 dpa: F=2.762, P=0.0983; see Figure 6A). Overall these results 
show that dsl0F2A is the constructed and tested fusion that works the best in tobacco 
seeds from 16dpa; and that this is due to the effect of Ha dslO Gl 3'-flanking sequences 
included in it. It is not know whether this effect is caused by transcriptional activation, or 
caused by mRNA stabilisation mechanisms, or by a combination of both. In any case the 
effect is clear and of potential usefulness to design new chimeric genes with more 
efficient expression in seeds, since relatively early embryonic maturation stages (see also 
the section "Other Examples"). 

On the other hand, the comparison of the GUS activities in plants with the dslOFI 
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and dslOFS fusions allowed to investigate the possible effects of the presence of the 
intron (and/or Ha dslO Gl coding sequences in which these fusions differ. Figure 5) on 
the expression of both fusions. In transgenic tobacco seeds these comparisons 
demonstrated that the presence of the intron (plus the first total exon and part of the 
second exon) does not have positive effects on GUS expression, which must therefore be 
essentially conferred by the Ha dslO Gl promoter and the sequences present in dslOFl 
(Figure 6A). Thus for example, the activities of dslOFI and dslOF3 are not statistically 
different between 12 and 28 dpa, except at 20 dpa (F=4.73, P=0.031) and then the 
presence of additional sequences in dslOFB significantly reduced the GUS activity 
observed. Therefore, even though it is highly probable that the intron is correctly 
processed in the seeds of heterologous systems such as tobacco (there is not any formal 
proof), its possible regulatory role in embryonic development is unclear. However other 
observations do not exclude that the additional Ha dslO Gl sequences in dslOFS 
(including the intron) may have regulatory roles in other tissues (see below the effect of 
these sequences on residual expression of dslO::GUS fusions in pollen and seedlings). 

Embryonic specificity (to seeds) of GUS expression conferred by the Ha dslO Gl 
sequences in tobacco transgenic plants was verified through investigations in other 
tissues; both in the absence of stress as well as after dehydration and ABA treatments. In 
the case of TO plants, the only tissue wherein GUS activity was detected by fluorimetric 
and histochemical assays, was mature pollen. In other tissues the activities detected barely 
exceeded background levels (non-transformed tobacco plants). For example, in TO plant 
leaves of about two months of age: 0-50 pmol MU/ mg x min. The activities detected in 
pollen are marginal (almost three orders of magnitude less) when compared with those of 
seeds from the same transgenic plants. Furthermore, this expression could be an artefact 
and depend on the use of GUS gene as an indicator in the fusions [according to Uknes S, 
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Dincher S, Friedrich L, Negrotto D. Williams S, Thompson-Taylor H, Potter S, Ward E, 
and Ryals J, in the Plant Cell 5: 159-169, 1993]. However, it was surprisingly observed 
that the activity measured in the pollen of the 9 dslOF3 plants was (136 ±64 pmol MU/ 
mg X min) significantly less than that of the 5 dslOFl plants (6427 ±1294 pmol MU/ mg x 
min; F= 72.573, P= 0.0001). The latter could indicate that, unlike what is observed in 
seeds during most of their embryonic development (Figure 6A), the presence of the 
additional Ha dslO Gl sequences in dslOF3 (including the intron) may reduce the 
expression of the chimeric genes containing them in other tissues or stages of 
development. 

The possibility of expression of the dslO::GUS fusions being induced by 
hormones (ABA) or stress treatments (water deficit) in tobacco transgenic plants (Tl) at 
different times in its vegetative cycle was also checked. In order to do this, descendants of 
8 different original plants were selected, after germination in MS medium with 300 |ag/ml 
kanamycin, containing dslOFI, dslOF2A and dslOF3; and another 6 with dslOF2. The 
resistant seedlings were transplanted in MS medium. Various experiments were carried 
out with seedlings, both at 8 and 15 days after imbibition. For the ABA treatments, the 
seedlings were transplanted in MS plates supplemented with 100' fxM ABA and cultivated 
in this medium for 4 days at 25°C in light. The control seedlings were also transplanted in 
MS medium without ABA. Water stress was induced by placing the seedlings for about 5- 
6 hours in a laminar flow hood between two filter papers. After the different treatments, 
the seedlings were processed either individually (for the histochemical assays with X- 
gluc, by 14 h incubations at 25 °C); or jointly (pool analysis), for the GUS activity 
fluorimetric assays as described previously. The adult transgenic plant treatments, were 
carried out using individual plants propagated as vegetative clones obtained from each 
original plant. To do this, the seedhngs selected from each transgenic plant were 
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transplanted to venniculite imbibed with Hoagland 0.5X medium. From each seedHng 
three complete explants were obtained, which were placed in hydroponic culture, after 
recovery, in liquid Hoagland medium (0.5X). The experiments were carried out when the 
plants had completely recovered from the propagation process, and had roots, stem and 
about 10-12 leaves. Therefore, genetically identical plants from each selected transgenic 
seedling were used for the different treatments. The ABA treatments were carried out by 
adding the hormone to the medium (100 |xM) and analysing the GUS activity in the plants 
after 24h. Water stress was induced by removing the root from the container with the 
medium, also analysing the plants 24h after starting the treatment. The effect of the 
different treatments was assessed in three independent experiments performed with the 
following number of Tl plants for each fusion (the number of TO plants from which they 
proceed in each case is given in parenthesis): dslOFl, 11 (6); dslOF2, 10 (5); dslOF2A, 5 
(3); and dslOF3, 10(5). 

The experiments carried out both in seedlings and in adult plants which confirmed 
the embryonic specificity of the expression conferred by the Ha dslO Gl sequences to the 
different fusions, also provided additional clues to the possible regulatory role of the Ha 
dslO Gl sequences present in dslOF3 (including the intron) previously mentioned. Thus, 
both in control adult plants as in treated plants, minimum GUS activities (from 3 to 300 
pmol MU/ mg x min) were detected in all the tissues analysed (roots, stem, leaves and 
apical meristem). These activity levels are only slightly above the background levels and 
can only be detected fluorimetrically (data not shown). 

In 8 dpi seedlings, the expression of all the fusions is about two order of 
magnitudes lower than the maximum levels reached in seeds. This expression rapidly 
decreases between 8 and 15 dpi (e.g. dslOFI goes from 2864 ±182 to 813±104 pmol MU/ 
mg X min); and is exclusively restricted to embryonic tissue (cotyledons), without being 
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detected in other vegetative tissues (radicle, hypocotyl, leaves) differentiated after 
germination (Figures 6D and E, and data not shown for the other fusions). These results 
confirm the embryonic specificity of the regulation by Ha dslO Gl sequences in 
transgenic tobacco plants. Apart from the general reduction in GUS activity values 
mentioned previously, differences between the values of the different fusions, some of 
them statistically significant, were observed. These differences were qualitatively similar 
to those observed in seeds (Figure 6A). Among them, and due to their possible applied 
interest, the reduction of expression after germination, mediated by the Ha dslO Gl 
sequences present in dslOF3 (including the intron) is illustrated. This effect is observed as 
a significant reduction of GUS activity when the dslOFl and dslOF3 plant expression 
patterns are compared (Figures 6D and E). The statistical analysis of the quantitative 
dslOFI and dslOF3 data confirmed the significance of this difference, both at 8 dpi (F= 
4.36, P= 0.04) and at 15 dpi (F= 4.39, P= 0.039). Additionally, a moderate induction of 
GUS by ABA treatment in dslOFI seedlings was observed, which is statistically 
significant (from 2864 ±182 to 5790 ±733 pmol MU/ mg x min; F= 5.413, P= 0.023). In 
the case of dslOF3 there was no significant induction by the same treatment (from 1502 
±195 to 2338 ±211 pmol MU/ mg x min; F= 2.58, P= 0.11). The different treatments did 
not substantially affect the tissue specificity, or the order of magnitude of the expression 
observed for the different dslO::GUS fusions (data not shown). 

OTHER EXAMPLES: 

Other chimeric genes can be obtained, in an analogous manner to that of the one 
described in detail in the previous example, said chimeric genes containing 5'-flanking, 
and(or) 3 -flanking (terminators), and(or) coding sequences from Ha dslO Gl, combined 
with sequences from other genes. These examples do not involve any additional technical 
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complications to those described in more detail in the previous sections, for which reason 
they can easily be carried out by persons with sufficient knowledge in the field of the 
invention technique. Thus for example, in dslO::GUS fusions the Ha dslO Gl could have 
included other longer 5'-flanking (Figure 1) sequences of the same gene to increase its 
expression level in seeds as described in Coca MA, Almoguera C, Thomas TL, and 
Jordano J, in Plant Molecular Biology ,31: 863-876, 1996. Equally, the GUS sequences 
could be substituted by others, coding for different proteins or peptides (natural or 
artificial), whose regulated production in plant seeds could be of industrial interest. 
Examples of these last possibilities, non exclusively, would be the fusion with Ha dslO 
Gl sequences of coding sequences of genes involved in fatty acid biosynthesis in seeds 
[Voelker TA, Worrell AC, Anderson L, Bleibaum J, Fan C, Hawkins DJ, Radke SE and 
Davies HM, in Science, 257:72-74, 1992], of storage proteins with compositions rich in 
specific amino acids [Saalbach I, Pickardt T, Machemehl F, Saalbach G, Schieder O, and 
Muntz K, in Molecular and General Genetics 242: 226-236, 1994], or peptides with 
antigenic or pharmacological activities [Vandekerckhove J, Van Damme J, Van 
Lijsebettens M, Botterman J, De Block M, Vandewiele M, De Clercq, Leemans J Van 
Montagu, M and Krebbers E, in BioTechnology 7: 929-932, 1989]. These fusions would 
be carried out and used in an analogous manner to what is described in the publications 
cited as an example (not exclusive) in each case. To facilitate these possibilities, a 
plasmid (dslOECl) was created, that contains an expression cassette including the 
promoter and the 5'- and 3'-flanking sequences of Ha dslO Gl present in dslOF2A (see 
Figure 5). Between both sequences and through directed mutagenesis [Chen E and 
Przybila AE, in BioTechniques 17: 657-659, 1994] an Eco RI restriction site was added. 
The addition of said restriction site allows the insertion of gene sequences, or of the 
corresponding peptide sequences, as previously mentioned (available in other 



26 

laboratories, or that could be designed or synthesised). The dslOECl plasmid was 
constructed from dslOGlS3A10.5 (Figure 1). From this plasmid, the Ha dslO Gl 
sequences between positions -1574 (Sal I) and +98 by PGR were amplified, by using 
DNA polymerase Pfu and the primers 5-ATTAACCCTCACTAAAG-3' [SEQ ID NO: 5] 
(T3) and 5 -GAGTGAACAgAATtcCATCACAACAGGG-3' [SEQ ID NO: 4] (dslOEco 
RI). In the latter the three sequence changes (indicated in lower case letters) introduce the 
new Eco RI site in the position of the initiation codon. After PGR, a 199 pb (megaprimer) 
DNA fragment is purified, which along with the 5'-AATACGACTCACTATAG-3' [SEQ 
ID NO: 6] (T7) primer is used for a second PGR amplification of dslOGlS3A10.5. The 
amplified DNA (795 pb) was digested with Eco RI and Sph I. The resulting DNA 
fragment (125 pb), with the Ha dslO Gl sequences between Sph I (-126) and the new Eco 
RI site, was purified and ligated by replacing in dslOGlS3 the Ha dslO Gl (Figure 1) 
sequences between positions -126 (Sph I) and 1086 (Eco RI). After this step, the PGR 
amplified sequence was verified by means of sequencing (Sanger's method) using the T3 
primer. Finally, an Ha dslO Gl genomic DNA fragment (Figure 1) was inserted in the 
plasmid obtained in the previous step, with sequences between +1086 (Eco RI) and 
==+3000 (Xba I), obtaining the dslOECl cassette (Figure 4), cloned in the pBluescript 
SK+ plasmid. The 3' end of dslOEGl DNA differs from that of dslOF2A only by 119 
additional nucleotides, corresponding to the intron and second ex on sequences of Ha dslO 
GL Furthermore, the Ha dslO Gl sequences in dslOECl differ from the corresponding 
ones in dslOF2A in the absence of nucleotides 1-98 of the first exon (Figure 5). 

Given that the presence of additional Ha dslO Gl sequences in dslOF3 (including 
the intron, the first exon and part of the second exon) reduced the expression of this 
chimeric gene specifically in non embryonic tissues (Example 3, Figures 6D-E), it is 
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conceivable that such sequences may be used to confer seed specificity to other chimeric 
genes with different promoters. The design of such chimeric genes does not involve 
additional technical difficulties other than those described in the previous sections: see for 
example the detailed procedures on the use of plant introns to prevent the expression of 
chimeric genes in Agrobacterium [Mankin SL, Allen GC and Thompson WF. Plant 
Molecular Biology Reporter \5: 186-196, 1997]. 

The chimeric genes containing the Ha dslOGl regulatory sequences could be 
transformed to other plants different from tobacco (the model system used in example 3). 
Among these, there are plants with major economical interest such as: sunflower, 
soybean, oilseed rape, "canola", maize, wheat, barley, rice, cassava, bean, peanuts, etc. 
whose genetic transformation is possible and has been sufficiently documented in the 
scientific literature: see for example Lindsey K, Ed. (1993). [Plant Tissue Culture 
Manual. Kluwer Academic Publishers]; and the review by Christou [Trends in Plant 
Science. 1: 423- 431, 1996]. The results shown in example 3 demonstrate that, in tobacco, 
the genes constructed with the Ha dslO Gl regulatory sequences have a high activity 
since relatively early embryonic maturation stages, and also maintain the seed specificity 
characteristic of Ha dslO Gl in sunflower. These results could also be obtained with other 
plants, such as those previously mentioned. 

DESCRIPTION OF THE FIGURES: 

Figure 1. Upper section: restriction map of the Ha dslO Gl genomic sequences 
that flank its coding region. The continuous lines on the map indicate the different 
genomic DNA fragments subcloned in pBluescript SK+ vector (the names of the 
respective fragments are indicated over each fragment). The plasmids prepared by Exo m 
deletions are indicated over the original plasmid (dslOGlS3ASacI), indicating in each 
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case the deletion end. On the lower section of the figure a detailed restriction map of the 
region whose nucleotide sequence was determined, is shown. The extension of the 
different reactions used to assemble the different sequences of both DNA strands, are 
indicated by horizontal arrows (above the map for the coding strand, and underneath the 
map for the non-coding strand). The transcription initiations sites are indicated with 
arrows. Scale bars are included for both maps. 

Figure 2. Functional implications of the RYl (-129) sequences in the trans- 
activation of the Ha dslO Gl promoter. Transient expression experiments carried out after 
bombarding the sunflower embryos with DNA coated micro-projectiles. The results of 5 
independent experiments, in which the different plasmid mixtures (described in Example 
1) where bombarded five times in each experiment, are presented. The average P- 
glucuronidase (GUS) activities normalised versus luciferase activity (LUC), as well as the 
standard error (indicated with bars), are presented. Key: F2, pSKdslOF2; F2ARY1, 
pSKdslOF2ARYl; ABI3, samples with the effector plasmid. A significant decrease in the 
relative GUS/LUC activity is observed, due to a mutation in the RYl box. The basal 
activities for pSKdslOFl (without including the effector plasmid) are of the order of 
46±8. 

Figure 3. Accumulation patterns of Ha dslO Gl gene mRNA in sunflower. The 
autoradiograph shown corresponds to the RNAse A protection tests, after hybridising a 
gene riboprobe with different total RNA samples. An accumulation of messenger RNA 
produced from Ha dslO Gl transcription initiation sites (as protected fragments indicated 
by the numbered arrows) is observed. These fragments are only detected in embryos 
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(Emb) from 10 to 20 dpa and in mature seeds (25 dpa), but not in other samples tested, 
such as seedhngs (Germ) or seedhngs treated with ABA (Germ + ABA). The carrier 
tRNA corresponds to control hybridisations with yeast tRNA. The bands corresponding to 
the mRNAs produced from the different initiation sites are indicated with numbers and 
arrows. The initiation site number 3 (indicated in parenthesis) has not been 
experimentally confirmed by primer extension. On the left margin are included molecular 
size markers (pBR322/Hpa HI). 

Figure 4. Localisation of mRNA in sunflower embryos sections at 12 (A and B), 
21 (C-E), and 28 dpa (F-H), The following riboprobes were used in each case: dslO (-), 
A, C, F, H; dslO (+),E, and 18S rRNA, B, D, G. Scale bar = 500 fim (Except in F, 125 
/xm). Palisade parenchyma= pp. The arrows mark the procambium. 

Figure 5. Restriction maps of dslO::GUS fusions and optimised expression dslO 
ECl cassette, constructed in Examples 3 and 4. The Ha dslO Gl and other genes 
contained in each case are shown by shading of varying intensity. The transcription 
initiation sites from the Ha dslO Gl promoter are marked with arrows. 

Figure 6. Expression of dslO::GUS fusions in tobacco transgenic plant seeds. 
Panel A: Summary of quantitative data (fluorimetric determinations). The average of 
GUS activities in transgenic plant seeds (TO), and its evolution through the different 
embryonic development stages is shown. The data corresponding to each fusion are 
indicated by the symbols in the upper left-hand insert. The bars indicate the standard 
errors. Panels B-E: representative selection with results of the histochemical GUS activity 
localisation experiments: B.- embryos at 12 dpa (plants dslOF2A, TO). C.- embryos and 
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endosperm at 16 dpa (dslOF2A plants, TO). D.- seedlings at 15 dpi under control 
conditions (dslOFl plants, Tl) E.- seedlings at 15 dpi under control conditions (dslOF3 
plants, Tl) In panels D and E, the arrows indicate the plant tissue without GUS activity 
(leaves and hypocotyl). 

5 

LIST OF SEQUENCES: 

SEQ No. 1: Ha dslO Gl gene nucleotide sequence. The transcription initiation 
sites experimentally determined (site 3, which has not been confirmed by primer 
extension is indicated in parenthesis) are indicated by arrows. The coding zone is shown 
10 by its amino acid translation indicated by the letter (LOl etc.) code underneath the 
nucleotide sequence. The termination codon is indicated by an asterisk. The sequence is 
numbered (on the left margin) starting from the initiation codon. The intron sequences are 
shown in lower case letters. The TATA box (in position -86) and RY box (-129 and -65) 
mentioned in the text (Example 1) are shown underlined. 
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